Modeling Spammer Behavior: Artificial Neural Network vs. Naïve Bayesian Classifier

نویسندگان

  • Md. Saiful Islam
  • Md. Rafiqul Islam
چکیده

The exponential growth of spam emails in recent years is a fact of life. Internet subscribers world-wide are unwittingly paying an estimated €10 billion a year in connection costs just to receive "junk" emails, according to a study undertaken for the European Commission. Though there is no universal definition of spam, unwanted and unsolicited commercial email as a mass mailing to a large number of recipients is basically known as the junk email or spam to the internet community. Spams are considered to be potential threat to Internet Security. Spam's direct effects include the consumption of computer and network resources and the cost in human time and attention of dismissing unwanted messages. More importantly, these ever increasing spams are taking various forms and finding home not only in mailboxes but also in newsgroups, discussion forums etc without the consent of the recipients. Overflowing mailboxes are overwhelming users, causing newsgroups and discussion forums to be flooded with irrelevant or inappropriate messages. As a consequence, users are getting discouraged not to use them anymore though these systems can provide numerous benefits to them. Combating spam is a difficult job contrast to the spamming. Millions of spammers around the world are engaged in spreading spams with ever changing tricks and tactis to circumvent the filters deployed by the mailbox providers. As spammers are paid per volume for thier job, they invest thier best effort in reaching everyone by all possible ways. No antispamming technique is hundred percent accurate for spam problem. Antispamming techniques try to make a trade-off between rejecting legitimate e-mail vs. not rejecting all spam, and the associated costs in time and effort. The simplest and most common approaches are to use filters that screen messages based upon the presence of common words or phrases common to junk e-mail. Other simplistic approaches include blacklisting and whitelisting. • Blacklisting technique automatically rejects messages received from the addresses of known spammers. • Whitelisting accepts messages received from known and trusted correspondents only. The major flaw in the first two approaches is that it relies upon complacence by the spammers by assuming that they are not likely to change (or forge) their identities or to alter the style and vocabulary of their sales pitches. Whitelisting risks the possibility that the

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning Approaches for Modeling Spammer Behavior

Spam is commonly known as unsolicited or unwanted email messages in the Internet causing potential threat to Internet Security. Users spend a valuable amount of time deleting spam emails. More importantly, ever increasing spam emails occupy server storage space and consume network bandwidth. Keyword-based spam email filtering strategies will eventually be less successful to model spammer behavi...

متن کامل

EFFECTS OF THE HARDENED NICKEL COATING ON THE FATIGUE BEHAVIOR OF CK45 STEEL: EXPERIMENTAL, FINITE ELEMENT METHOD, AND ARTIFICIAL NEURAL NETWORK MODELING

Hardened nickel coating is widely used in many industrial applications and manufacturing processes because of its benefits in improving the corrosion fatigue life. It is clear that increasing the coating thickness provides good protection against corrosion. However, it reduces the fatigue life. Thus, applying a thin layer of coated nickel might give an acceptable corrosion protection with minim...

متن کامل

Naïve Bayes vs. Decision Trees vs. Neural Networks in the Classification of Training Web Pages

Web classification has been attempted through many different technologies. In this study we concentrate on the comparison of Neural Networks (NN), Naïve Bayes (NB) and Decision Tree (DT) classifiers for the automatic analysis and classification of attribute data from training course web pages. We introduce an enhanced NB classifier and run the same data sample through the DT and NN classifiers ...

متن کامل

A Novel Methodology for Constructing Rule-based Naïve Bayesian Classifiers

Classification is an important data mining technique that is used by many applications. Several types of classifiers have been described in the research literature. Example classifiers are decision tree classifiers, rule-based classifiers, and neural networks classifiers. Another popular classification technique is naïve Bayesian classification. Naïve Bayesian classification is a probabilistic ...

متن کامل

Modeling of Compressive Strength of Metakaolin Based Geopolymers by The Use of Artificial Neural Network RESEARCH NOTE)

In order to study the effect of R2O/Al2O3 (where R=Na or K), SiO2/Al2O3, Na2O/K2O and H2O/R2O molar ratios on the compressive strength (CS) of Metakaolin base geopolymers, more than forty data were gathered from literature. To increase the number of data, some experiments were also designed. The resulted data were utilized to train and test the three layer artificial neural network (ANN). Bayes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012